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NEW PERFORMANCE GUARANTEES FOR THE GREEDY 
MAXIMIZATION OF SUBMODULAR SET FUNCTIONS 

JUSSI LAITILA AND ATTE MOILANEN 


Abstract. We present new tight performance guarantees for the greedy 
maximization of nondecreasing submodular set functions. Our main re¬ 
sult first provides a performance guarantee in terms of the overlap of the 
optimal and greedy solutions. As a consequence we improve performance 
guarantees of Nemhauser, Wolsey and Fisher (1978) and Conforti and 
Cornuejols (1984) for maximization over subsets, which are at least half 
the size of the problem domain. As a further application, we obtain a 
new tight performance guarantee in terms of the cardinality of the prob¬ 
lem domain. Approximation and Cardinality and Convex optimization 
and Greedy algorithm and Maximization and Steepest ascent 


1. Introduction 

Let A be a finite set, X = {xi,..., Xn}, and let T be an integer such that 
0 < T < n. We consider the cardinality-constrained maximization problem 

(1) max{f{S):\S\ = T,S CX}, 

where /: 2^ —>■ M_|_ is a submodular set function. Recall that / is submodular 
if 

(2) f{S) + f{R)>f{SUR) + f{SnR) 

for all S', i? C A; see, e.g., [16) . We further assume that / is nondecreasing; 
fiS) < /(-R) for all S G R, and, without loss of generality, that /(0) = 0. 
We consider the following well-known greedy algorithm for solving problem 

m- 

Algorithm A. 

: Step 0: Set So = 0- Go to Step 1. 

: Step t {1 < t < T): Select any xt G St-i such that 

f{St-i U {xj) = max{/(S't_i U {x}): x € A \ St-i}. 

Set St = St-i U {xt}. Go to step t + 1. 

: Step T -|- 1: Set Sgr = St- Stop. 
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Algorithm [Xl has been extensively studied in the literature. By the Rado- 
Edmonds theorem m or 0), it finds an optimal solution when / is an 
additive set function, i.e., when ([2]) holds with an equality for all S,R C X. 
Nemhauser, Wolsey and Fisher |16| (see also |3], |15) i gave the following 
performance guarantee for Algorithm lAl for nonadditive functions /: 

(3) ^ 2 ^ - I) 

where Sopt is an optimal solution to problem ([T]). Conforti and Cornuejols 
[1] improved Q to 


(4) 


fjSgr) ^ 1 
f{Sopt) ~ a 



Gcc{T, a), 


for a E (0,1], where a E [0,1] is the total curvature 


a = max 


1 - 


fiX)-fiX\{x}) 

/(W) - /(0) 


:xEX,/({x})//(0) 


It is known that a E (0,1] if and only if / is nonadditive [3]. Clearly, 
Gnwt{T) = Gcc{T,'V) and since Gcc{T,a) —> 1 as a —>■ 0"'“, dH can 
be viewed as a generalization of the Rado-Edmonds theorem. The above 
performance guarantees further satisfy the estimates 


Gcc{T, a) > max 


^Gnwf{T), 


1 - e~^ ) 

a J 


> 1 - e 


-1 


for all a and T. The guarantees Q and ([3|) are tight for suitable choices of 
parameters T and a. For example, for all a E (0,1] and T > 1 there is a 
problem of the type m and the corresponding greedy solution Sgr such that 
fiSgr) = Gcc{T,a)f{Sopt) m 

Submodular optimization has played a central role in operations research 
and combinatorial optimization [8]. By now it has applications in various 
fields, including computer science |12) . economics |18) and, more recently, 
ecology (di, m, Cl)- Problem m and the above performance guarantees 
have been extended to various other settings and problem structures, re¬ 
lated to, for example, matroid and knapsack (im, |13| 1 constraints, 

continuous algorithms ([20], El), nonmonotone functions [5] , nonsubmodular 
functions m and supermodular minimization f|in). m)- 

To authors’ knowledge, previously presented performance guarantees ei¬ 
ther do not depend on T or n, or, like ([3|) and (HD, they are decreasing in T. 
However, when T = n, it is clear that Sopt = Sgr, so the greedy algorithm 
returns the optimal solution. This suggests that any performance guarantee 
should in fact be improving when T approaches and is close enough to n. We 
show that this is indeed the case. More generally, we show that increasing 
degree of overlap m = | Sopt H Sgr \ between the sets Sopt and Sgr improves 
the performance guarantees. While in applications the overlap m may not 
be known, we can give this quantity a useful lower bound. In fact, when 
T > n/2, we have m > 2T — n > 0. Our results thus have particular rele¬ 
vance for optimization problems where the maximum is sought over subsets 
of cardinality larger than n/2. 
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Let 


G{T, a, m) 




T—m 


and G{T, a, n) = G{T, a, max{0, 2T — n}). Our main result is the following. 


Theorem 1. Let a G (0, 1], letl <T < n and let Sopt o,nd Sgr be an optimal, 
repectively a greedy, solution to problem © and let m = \ Sopt O Sgr \. Then 

(5) > G(r, a, m) > G(r, a, n). 

Moreover, these bounds are tight in the following sense: for every a G (0,1] 
and numbers n and T such that 1 < T < n, there is a problem of the type 
and its greedy solution Sgr such that max{0, 2T — n} = \Sopt O Sgr\ and 


fjSgr) 

fiSopt) 


G{T, a, n). 


We postpone the proof of Theorem [T] to Section [2l 


Remark 2. Theorem [T] strictly improves dH and provides further examples 
of cases where the performance guarantee equals one, i.e., generalizations of 
the Rado-Edmonds theorem. Indeed, for all T and n such that T > n/2, we 
have 

G{T, a, n) > Gcc{T, a). 

For T = n, we get G{n, a, n) = 1. Note that, by dH, limQ,_^Q+ G(T, a, n) = 1. 
Moreover, in the case m = T, we again get G{T, a, T) = 1. 

Using Theorem [H one can derive other new performance guarantees for 
the greedy algorithm. As an example of independent interest, we present the 
following performance guarantee in terms of n only. 


Corollary 3. Let a G (0,1], 1 < T <n, and let Sopt ct'iT’d Sgr be an optimal, 
repectively a greedy, solution to problem dl|)- Then 


( 6 ) 


fjSpr) ^ 1 
f{Sopt) ~ a 





where [xj denotes the largest integer not greater than x. The left-hand esti¬ 
mate is tight in the following sense: for every a G (0,1] and n > 2, there is 
a problem of the type ® and its greedy solution Sgr such that 


fjSgr) 

fiSopt) 



Proof. If n is an odd integer, it is easy to check that the minimum of 
G{T,a,n) over all integers T with 0 < T < n is G((n—1)/2, a, n). Moreover, 
when treated as a continuous function of T, G{T, a, n) attains its minimum 
at T = n/2. Together with Theorem [1] this yields dS]). Tightness of the left- 
hand inequality in dS]) follows from Theorem [1] with the choice E = . □ 
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2. Proof of Theorem [T] 

In this section we present a proof of Theorem [TJ We first prove ([5|). Note 
that the right-hand inequality in Q follows directly from m = | Sopt H Sgr \ > 
max{0, 2T — n} and the fact that G'(T, a, m) is increasing in m. 

We next prove the left-hand inequality in ([5]). We may assume that 0 < 
m < T. Indeed, if m = T, then Sgr = Sopt and the claim is trivial. If m = 0, 
the claim follows from (|3]). 

Let S'o = 0 and St = {yi,... ,yt} C X be the successive sets chosen by 
the greedy algorithm for t = 1,..., T, so that S'o C Si C • • • C St- Let 

_ f{St) - f{St-i) 

fiSopt) ’ 

for t = 1,... ,T. Because / is nondecreasing, each at is nonnegative and 

fiSopt) 

Let J = Sgr n Scrpt- Let < ji < ■ ■ ■ < jm denote the indices for which 
= {%!) • ■ ■ yjm)- Denote jo = 0 and jm+i = T. By Lemma 5.1 of [1], we 
obtain the T inequalities 

l<a X] “*+ Z] ai + (r-|Si_in Sopt I )^t; 

yi^St—i\Sopt} yi^St—i(~^Sopty 


for t = 1,..., T. Consequently, 


fjSgr) 

fiSopt) 


> Bif), 


where J' = {ji,..., jm} and, for U C {1,... , n}, BfU) denotes the minimum 
of the linear program 


T 

(7) minimize y] bj 

i=l 

S.t. « Z Z bi + {T-\unVt-i\)bt>i, 

ieVt-i\u i£UnVt-i 

bt > 0 , 

for t = 1,...,T, where Vt = {!,...,f}. We next apply the proof of [U 
Lemma 5.2], which implies the following two facts: 

(i) UT ^U, then BiU) > Bi{T - \U HVt-iI, ... ,T}), 

(ii) .B({rT}) > B{{T-l + l,...,T}), for all 1 < / < T - 1. 

In particular, if jm < T, then B{J') > B{{T — m,..., T}) > Bi{T — m-\- 
1,... ,T}). Moreover, if jm = T, then B(J') = BiJ' \ {T}), so that using 
(i), B{.J') > B{{T - m + 1,..., T}). Consequently, 


fjSgr) 

fiSopt) 


> 




i=l 
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where h* = {b^,... ,b^) is an optimal solution to the problem ([7]) with U = 
{T — m + 1,... ,T}. By the weak duality theorem, we get that 

where c* = (c|,..., c^) is an optimal solution to the dual problem 


T 

(8) maximize q 

i=l 

T 

s.t. Tct + a Ci < 1, 1 < t < T — m 

T 

(2T — m + 1 — t)ct + Ci < 1, T — m + l<t<T 

2=t+l 

Cj > 0, i = 1,... ,T. 


Define the vector c = (ci,..., ct) by 


ct = 


_ (i_«) 


T—m—t 


T—m 


{2T—ra+l—t)(2T—m—t) ’ 


1 < t < T — m, 
T-m + l<t<r. 


An induction argument shows that c is a feasible solution of problem (j8|) 
(satisfying the T first constraints with an equality), so that 


T 

>^Ci. 

i=\ 

Moreover, it is easy to compute that 


fjSgr) 

fiSopt) 


and 


T—m 








T—m 


T 


Y = 

i=T—m+l 


m 

T’ 


which, after summation, yield the desired performance guarantee 

We next show the tightness of G{T, a, n) by modifying the proof of [H 
Theorem 5.4], Let 1 < T < re be any positive numbers. Pick any number 
1 < r < re/2, let X = {ai,..., o^, 6i,..., bn-r} and let /; 2^ —>■ M_|_ be the 
set function 

f{{ai^,...,ai^,bj^,...,bjj) = u+ (l “ ^ (l “ , 

k=l 
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defined for all subsets ,... ... ,bj^} C X. Then /(0) = 0. For 

any S = {ojj ,..., ai^,bj^,..., bj^} C X, where s < r and u < n — r, and 
tti £ X \ S, we have 

f{S U {a.}) - /(S) = (l - (l - > 0. 

For any S = {ojj ,... ,ai^,bj-^,... ,bj^} C X, where s < r and u < n — r, and 
bj £ X \ S, we have 

fisu{b,})-f(s)=i-^±{i-^y-'>o^ 

^ k=l 

By recalling that a set function 5 : 2^ —>■ M_|_ is submodular if and only if 

g{S U {x}) - g{S) > g{R U {x}) - g{R), 


for all S' C i? C X and x £ X \ R (e.g., |16|). these inequalities show that / 
is submodular and nondecreasing. Moreover, 


max n - : X € X,/({x}) / 0 

fiM) 

^ f{X)-f{X\{a,}) 

= '- im) — 


for any 1 < i < r, so / has total curvature a. 

Consider next the case where T > n/2. Set r = n — T, so that r < re/2 < T 
and n — r = T. It is easy to verify that Sopt = {&i, • • •, bx} is an optimal 
solution to problem ([T]) with f{Sopt) = T. Since /({oi}) = f{{bj}) = 1, 
for any 1 < j < T, the greedy algorithm can choose the element oi at the 
first iteration. Assume next that the greedy algorithm has chosen St-i = 
{oi,..., at_i} for some t < n — T. Using the fact 



it is easy to see that 


f{St-i U {aj) 




i—1 


i=l 


so the greedy algorithm can choose at at the tth iteration. We therefore can 
have Sgr = {oi,... an-T, bi,..., b 2 T-n}- This solution has the value 


fiSgr) 




The claim follows because rre = \Sopt H Sgr\ = 2T — re, whence we obtain 
n — T = T — m. 

The proof of case T < re/2 is easier, so we omit its proof. 
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